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Abstract: Corrected confidence intervals are developed for the mean of the 
second component of a bivariate normal process when the first component is 
being monitored sequentially. This is accomplished by constructing a first ap- 
proximation to a pivotal quantity, and then using very weak expansions to 
determine the correction terms. The asymptotic sampling distribution of the 
renormalised pivotal quantity is established in both the case where the covari- 
ance matrix is known and when it is unknown. The resulting approximations 
have a simple form and the results of a simulation study of two well-known 
sequential tests show that they are very accurate. The practical usefulness of 
the approach is illustrated by a real example of bivariate data. Detailed proofs 
of the main results are provided. 



1. Introduction 



Suppose that a sequential test is carried out to compare two treatments. Then, 
following the test, there is interest in making valid inferences about the different 
parameters. For example, the primary parameter will typically be some measure 
of the treatment difference and there may be several secondary parameters too. 
These could be the individual treatment effects or the effects of baseline covariates, 
such as age, gender, disease stage, and so on. However, the use of a sequential de- 
sign leads to the usual maximum likelihood estimators being biased and associated 
confidence intervals having incorrect coverage probabilities. One approach to the 
estimation problem is to study the approximate bias and sampling distributions of 
the maximum likelihood estimators. 

Until recently, much of the research on estimation following sequential tests fo- 
cussed on primary parameters. For example, a n ap proach based on approximately 
pivotal quantities was developed by Woodroofe [2J] in the context of a single stream 
of normally distributed observations. Here, interest lay in providing an approximate 
confidence interval for a mean. The work in the present paper extends this approach 
in several respects. We consider bivariate normal data, where interest lies in esti- 
mating the mean of the second component of the process when the first is being 
monitored sequentially. Further, we consider the case of an unknown covariance 
matrix for the process. 
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One of the first papers to address the problem of estimation of secondary pa- 
rameters following a sequential test was For large samples, he showed how the 
bias of the estimator of the secondary parameter is related to that of the primary 
parameter, and then showed how a bias-adjusted estimator of the secondary pa- 
rameter could be constructed. Gorfine has shown [7J how a theorem of Yakir 27 1 
may be used to define an unbiased estimator of the secondary parameter. Related 
work has been carried out by Liu and Hall 11 1. More recently, Hall and Yakir 0] 
develop tests and confidence procedures in a very general context. 

Several authors have developed methods for the construction of confidence in- 
tervals based on approximately pivotal quantities. Whitehead, Todd and Hall show 



2l| how approximate confidence intervals may be obtained for a bivariate normal 



process when the covariance matrix is known and then show how these may be 
applied to problems in which approximate bivariate normality can be assumed. Liu 
considers 12j a similar problem and shows how the appropriate corrections may 
be obtained using moment expansions, though the method developed appears to 
be somewhat restricted. In the present paper, we consider both the known and the 
unknown covariance matrix cases. 

The approximately pivotal quantities are constructed by considering the bivari- 
ate version of the signed root transformation, and then using a version of Stein's [lol ] 
identity and very weak expansions to determine the correction terms. The results 
in the known covariance matrix case are obtained by applying those of Weng and 
Woodroofe 17| for the two-parameter exponential family. In the unknown covari- 
ance matrix case, similar ideas to those used by Weng and Woodroofe [l8| in the 
context of stationary autoregressive processes are used to establish the asymptotic 
sampling distribution of the renormalised pivotal quantity. The resulting correction 
terms have a simple form and complement the results of Whitehead [19| • 

In Section 2, the correction terms for the usual approximately pivotal quantity 
in the known covariance matrix case are determined using results for the two- 
parameter exponential family and it is shown how these may be used to construct 
corrected confidence intervals for the secondary parameter. The case of an unknown 
covariance matrix is then considered in Section 3, and the asymptotic sampling 
distribution of the renormalised pivotal quantity is obtained. The results of a sim- 
ulation study of two well-known sequential tests are reported in Section 4 and a 
real example of bivariate data is used to illustrate the approach in Section 5. Some 
remarks and an indication of possible extensions to the present work are given in 
Section 6. Appendices contain detailed proofs of the main results. 



2. Bivariate normal process with known covariance matrix 

2. 1 . The general method for two-parameter exponential families 

Let Xj = (Xij, X2j)' for j = 1, . . . ,n be independent random vectors distributed 
according to a two-parameter exponential family with probability density 

where 9 = (9\, 62)' G £1 and Q is the natural parameter space, assumed to be open. 
Let L n (9) denote the log- likelihood function based on X\±. . . , x n , and consider the 
bivariate version of the signed root transformation (e.g. |4|) given by 



(1) 



Zni = Z nl (9) = v /2{i„(0 n )-L n (0 n )}sign(0 1 - 9 nl ) 
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and 

(2) Z n2 = Z n2 {9) = ^2{L n (9 n )-L n {9)}sign(9 2 - 9 n2 ), 

where 9 n = (9 n i,9 n2 )' is the maximum likelihood estimator and 9 n — (^1,^2)' is 
the restricted maximum likelihood estimator for fixed 9\. Then we have L n {9) = 
Ln(On) - \\Z n \\ 2 /2, where Z n = (Z nl ,Z n2 )'. 

Consider a Bayesian model in which 9 has a prior density £ with compact support 
in Q. Let denote expectation in the Bayesian model in which 9 is replaced 
with a random vector Q and let E£ denote conditional expectation given {Xj,j — 

1, . . . , n}. Then the posterior density of 9 given X\, . . . , X n is £ n (9) oc e Ln ^((9), 
and the posterior density of Z n is 

(3) £n(z) oc J{kMn{6) oc J(9 n ,9)aO)e~ hM \ 

where z and 9 are related by (P} and and J is a Jacobian term. From ©, 

(4) Cn(«) = fn(z)<h(z), zeti 2 , 

where 4> 2 denotes the standard bivariate normal density and 

f n (z)txJ0 n> e)m- 

Now, for p > 0, let H p be the set of all measurable functions h : 5ft 2 — > 5ft for 
which \h(z)\/c < 1 + ||z|| p for some c > 0, and let H = U p >oH p . So, Ho denotes the 
set of bounded functions. Let & k denote the standard /c-variate normal distribution 
for k = 1, 2 and write 



r/i = / hdT 

for an arbitrary measure T. Given h £ H p , let ho = <J> 2 /i, h 2 = h and 
and 



(5) 



51(2/1,2/2) = e^i / -/i }e ^ 2 du;, 



1 „,,2 



32(2/1,2/2) = e 2 ^ 2 / {/i2(yi, w) - /ii(yi)}e 2 ™(i!c 



for -00 < 2/1,2/2 < 00 ■ Then let = (31,32)' and V/i = (L7 2 /i + U 2 h')/2, where 
C7 2 /i is the 2x2 matrix whose zth column is Ugi and §i is as in ([5|). Lemma [T] below 
follows from Lemma 1 of Weng and Woodroofe [T3| • 

Lemma 1 (Stein's identity). Let r be a nonnegative integer. Suppose that dT = 
fdQ 2 , where f is twice differentiate on 5ft 2 for which 

\f\ d &+ f (i + \\z\n\\wf(z)\\^(dz)< (X 

and 

'' (l + ||z|ni|V 2 /(z)||$ 2 (dz)<cx). 
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Then 



Th = Tl-<$> 2 h + <5> 2 (UK)' { V/(z)$ 2 (dz)+ / tr{(V h)V 2 f}d<S> 2 

J® 2 Jst 2 

for all h £ H r . 

From ((4]) , the posterior distributions of Z n are of a form appropriate for Stein's 
identity. Let 

and 

I^(§ n ,6>)=n- V2/n(Zn) 



fn{Z n ) 

Now let -B ra denote the event {#„ e V&(f2)} and let So denote the collection of all 
prior densities £ = with compact support in for which £ is twice differentiable 
almost everywhere under P^, and V 2 £ is bounded on its sup port. Proposition [2] 
below follows from Proposition 2 of Weng and Woodroofe 17| . 



Proposition 2. Suppose that £ € So- Then 

1 



= &h + -=(&uhyE?{ri(o n ,e)} + -ti[E?{vh{z n )v\ 



(0n,fl)}] 



almost everywhere on B n , for all h £ H . 



Let iV = iV Q be a family of stopping times, depending on a design parameter 
a > 1. Suppose that 

in -probability for almost every 9 £ fi, where p is a continuous function on O. 
Suppose also that, for every compact set K C fi, there is an 77 > such that 

(6) Pe(N a < na) = o^ 9 ), 

uniformly with respect to 9 € K as a — > 00, for some q > 1/2. Lemma 3 below 
follows from Theorem 12 of Weng and Woodroofe [17|. Moreover, by their Lemma 
5 and © above, we have Pg(B c N ) = o(l/a). 

Lemma 3. The random vector Zn = {Zn\,Zn2)' is uniformly integrable with 
respect to P^. 

In what follows, suppose that 9\ is the primary parameter and that #2 is a 
nuisance parameter. Then, for inference about 9\, it is appropriate to use Z^i- 
Now, from Proposition 2, 

E^hiz^j^^h+^i^uhjE^ri^N^^+^E^ivhiz^rl^^)}. 



To determine the mean correction term for Z^i, we take h(z) = z, in which case 
= 0, ^Uh — 1 and Vh(z) = 0. Similarly, for the variance correction term, we 
take h(z) = z 2 , in which case <& 1 ft, = 1, = and Vh(z) — 1. Denote by fry 

the partial derivatives hj(9) — d l+: >b(9) / d9\d9 3 2 , and similarly for £y. Let = 
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(620-^1/602)^), 12(e) = 6 O2 (0), Ti tl (9,9) = lim^rf^w^) and r| iU (0,0 
lim^^e r| n (w, 0), and let k(0) and m(0) be such that 

(7) £^(0)1^(0,0)}= / / e(0) K (0)d0id0 2 
and 

(8) ^VWl n(0, 0) - 2p(0) K (<?)r« x (0, 0) + k 2 (9)} = f f m(8)a0)d0id6 2 

J Jn 

Then some algebra yields 

(9) 6o2Zl 

+ P(g) ( ^"^V ) + ^'"^ Vfa (4 ■ 

A similar, but more complicated expression, may also be obtained for m(ff). 
Now, define 

U U J jv l(o) ' 



N 



where 



k N /^a~ if |Ajvj < a 1 / 6 {log(a)}- 1 , 

(11) = { a- 1/3 {log(a)}- 1 if kjv > ^{logfo)}" 1 , 

-a- 1 / 3 {log(a)}- 1 iiK W < -a 1/6 {log(a)}- 1 , 



and 



/ 12 ) f C°) = J \A+ "ijv/a if |tojv| < y/a/log(a), 

N 1 1 otherwise, 

with ktv = t(0jv) and = m(9^). 

Theorem 4. Lei h be a bounded junction. Suppose that p(9) is almost differentiable 
with respect to 9\ and 2 . J/ 0) fto/ds wzt/i 9=1 and £ G So, i/ien 

E t {h{Z<®)}=* 1 h + o(l/a). 

The proof is in Appendix A. 3. Theorem [4] shows that under mild conditions 
zffl is approximately standard normal to order o(l/a) in the very weak sense of 
Woodroofe [23]. It extends Theorem 14 of Weng and Woodroofe 17 1 by not requiring 
h to be symmetric and not assuming V 2 £ to be continuous. 

So, an asymptotic level 1 — a confidence interval for 9\ is 

(13) l N = {6 1 :\zP\<z a/2 }, 

where z Q / 2 is the 100(a/2)-th percentile of the standard normal distribution. 
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2.2. The bivariate normal model with known covariance matrix 

Suppose that Xj — {X\j,X%j)' for j — 1, . . . ,n are independent random vectors 
from a bivariate normal distribution with mean vector 9 = (6±, 62)' and covariance 
matrix 

yyai a 2 of 

Let tp = (of, of, 7)'. As before, let N = N a be the stopping time depending on a. 
Then,since the likelihood function is not affected by the use of a stopping time 
(e.g. [3]), the density of Xjm is 



p(x; 9, ip) = exp 

2a 2 a 2 (l- 7 2 ) 



Mog(27r)-^logKa 2 2 (l- 7 2 )} 



N N 



( 14 ) - 9 ^ 2n „^ { 4 E^w ~ 0i ) 2 + ff i E(^ - 

.7=1 J'=l 
JV 

- 270-1(72 E( x ij ~ ^Ofcaj - 
i=i 

If we assume that is unknown and ^ is known, then this model is a two-parameter 
exponential family with density that satisfies 

logp(a;; 9) = c{x) + N9 1 t 1 + N9 2 t 2 - Nb{9), 

where h = xi/{ct 2 (1 - 7 2 )} - "/x 2 /{aicT 2 (l ~ I 2 )}, t% = x 2 /{aj(l - 7 2 )} - 7x1/ 
{0-102(1 - 7 2 )} and b{9) = 9'Y>- 1 9/2. Since b{9) is quadratic in 9, both i x {9) and 
i 2 {9) defined in Section 2.1 are constants; and therefore k{9) in © reduces to 

ncn (-&02, fell) -Vp(g) 

(15) k(9) = — = -axpw, 

^02*1 

where ptj = d l+J p/ d9\d9 3 2 and the second equality in (fTSf follows since the stopping 
time N is assumed to depend only on X\\, . . . , X\n- Simple calculations show that 
the maximum likelihood estimator of 9 is (9i,9 2 ) = {Xni, Xn2) and that the 
restricted maximum likelihood estimator of 9 2 given 9i is 9 2 = ^2(^1) = $2 — 
^a 2 (9i — 9\)j<7\. By ([T|) and ©, it is straightforward to obtain 

(Z N1 , Z N2 ) = (^Na^ 1 (9 1 - ft), V^ffJ^l - 7 2 )~ 1/2 {^2 - & - 7^(#i - ^i)M})- 

Furthermore, since the stopping time depends only on the first population, it can 
be shown that m(9) in satisfies 

m{9) = k 2 (9) = {a lPw ) 2 . 

Then, substituting these Z^ii k and m into (JTDJ) , (TTT|) . and p2[) . by Theorem 01 
the approximate level I — a confidence interval for 9\ is as in (|13D . 

For inference about the secondary parameter 9 2 , it is not appropriate to use Z^ 2 
as it depends on both 9\ and 92- So, we consider the transformation 

(16) Z N1 = Z N1 (9) = sj 2{L N (9 N1 J N2 ) - L N (9 N1 ,9 2 )}sign{9 2 - 9 N2 ), 
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where 9ni = #/vi(#2) is the restricted maximum likelihood estimator of Q\ given 
9 2 . Then Zjvi = VN(9 2 — 9 2 )ja 2 . To obtain the mean correction term, we need to 
replace by and pij in (fT5|) with bji and pji. So, 



(-620,611) , ; ; 

(17) E e (Z N1 ) ~ = -p p^-V) = —r^Pio. 

V 5 V 5 6 20 (6o2 - fc) 1 / 2 

Using a similar trick, we obtain 

(18) m(0) = K 2 (0) = (a^pio) 2 . 

With this Z/vi and its corresponding mean and variance corrections, we obtain a 
renormalised pivot zffl as in (|10p . Then, by Theorem [4J an asymptotic level 1 — a 
confidence interval for 9 2 is 

(19) 6 N2 + -j=(j,y ±-=T' N 'z a/2 . 



This interval is of the same form as the one obtained by Whitehead, Todd and Hall 



21]. However, they use recursive numerical integration to calculate the correction 



terms instead of asymptotic approximations. 

3. Extension to unknown covariance matrix case 

In this section, we consider the following three cases: 

CI. <Ti and a 2 are known, but 7 is unknown; 
C2. <7i and a 2 are unknown, but 7 is known; 
C3. (Ji, a 2 and 7 are all unknown. 

When the parameters are unknown, we estimate them by erf — Ej=i(-^y — 9i) 2 / 
(N- 1) for i = 1,2 and 

Ef=i(*ii-0i)(^-0 2 ) 

7 = 



Ef=i(^-^) 2 Ef =1 (^-^) 2 

As the main interest of this paper concerns inference about the secondary pa- 
rameter # 2 , in the rest of the paper we let Zmi be as in (fT6|) . So the corresponding 
re(<7i,7, pio) and m(<7i, 7, pio) are as in (TTT|) and (fig)) . For cases C1-C3, we consider 
Kff = «;((7i,7,pio), = k(<7i,7,Pio) and k$ = k(cti, 7, pio), respectively; and 
correspondingly define /ij^ and rffl for fc = 1, 2, 3 as in (fTTj) and (fT2|) . Then, let 

(20) 4 1} = 



r 



N 



and 



r91 \ y(fe) _ gjVlCgg) ~ Ajyj 
( - J -' Z W - ITfe) 



T 



for fc = 2, 3, where Z N1 (& 2 ) = Vn(0 2 - 2 )/a 2 . We will use Z$ for k = 1,2, 3 as 
pivotal quantities for cases CI, C2 and C3, respectively. 
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Define tjjy = of/* 7 !' Then we can rewrite zffi for k = 2, 3 in (f2Tj) as 



(22) 



7 0) 

J AT 



(ft)^l 



f (fe) 



7 r ,(fc) A i/2 

^Wl ~ Mat ^Af 
. 1/2 . (ft) 
JV T AT 



In the rest of the paper, let S denote the collection of all prior densities 9) = 
£,i(^)£,2(9) with compact support in (0,oo) 2 x (—1,1) x Q for which £ is twice 
differentiable almost everywhere under and V 2 £ is bounded on its support. 

Theorem 5. Suppose that £ £ S and i/iai (0) ZioWs wii/i g = 1. Then, for k — 2,3, 



(23) 



/ / 

4, 



E^ 9 {h(Z^)} - &h - ^ A h)p 2 (9) 



£(ip,d)dddip 



o 



/or all bounded functions h. 

The definition of $4 and the proof are in Appendix A. 4. Theorem [5] shows that 
Zjfi' for k — 2, 3 are asymptotically distributed according to a t distribution with N 
degrees of freedom to order o(l/a) in the very weak sense, since § 1 h+($4h)p 2 (9)/a 
represents the first two terms in an Edgeworth-type expansion for the t distribution 
(e.g. [H, Chap.2; [§], Chap.2). Hence, 



(24) 



P*fi{\ z P\ < z } = ZG N (z) - 1 + o(l/o) 



very weakly, where Gn denotes the t distribution with N degrees of freedom. So, 
an asymptotic level 1 — a confidence interval for 62 is 

~jjj^ N VA Tjv CN - al2 ' 

where CN, a /2 is the 100(a/2)-th percentile of the t distribution with N degrees of 
freedom. Note that the form of the above interval is similar to one obtained by 
Keener flpj ] using fixed 9 expansions. However, his interval is only valid up to order 
o(l/ s/a) and only applicable to linear stopping boundaries. 

The proof of Theorem [5] reveals that the correction term (<£> 4/1) p 2 (9) / 'a in (f2"3"| 

arises from the use of ujn- Since a 2 is known for zff in (|20[) . this correction term 



vanishes in the asymptotic expansion for Z N 
orem[5]is the following result. 



and an immediate corollary to Thc- 
Corollary 6. Suppose that (eS and that ^ holds with q = 1. Then 

E^ e {h(Z$)} - ^h}^, 9)d9d^j 



I I 

Jfo.ooPxf-i.i) Jn 



a 



'(o,oo) 2 x(-i,i) Jn 

for all bounded functions h. 

Therefore, zff is asymptotically standard normal to order o(l/a) in the very 
weak sense, and consequently 

P^A\Z [ n\ <z}=2&(z)-l 



o(l/a) 



very weakly. From this, one can set confidence intervals for 9i as in (I19|) . but with 



pffl and f^ 1 replaced by p}^' and f 



>(o) 



(i) 



' N ■ 
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4. Simulation results 
4-1. General 

(k) 

Section 3 considers asymptotic results for Z N for a class of stopping times N — 
N a depending only on the first population. Specifically, let q denote a measurable 
function on 5i which is almost differentiable; let 6 n i — Y^j—i Xij /n and 

(25) N = N a = inf{n > m : nq(6 nl ) > a} A to, 

where too and m denote the initial sample size and the maximum size, respectively; 
mo = L a / e oJ an d m = L a / e2 J' l x \ i s the greatest integer less than or equal to 
x, a > 1 is a boundary parameter, e a truncation parameter and eo controls the 
initial sample size. In this section, we assess the accuracy of the method for two 
simulated examples, the truncated sequential probability ratio test and the repeated 
significance test. The actual coverage probability and expected stopping time are 
assessed through simulation for <j\ = u% = 1 and selected values of (61,62, 7). 



4-2. Truncated sequential probability ratio test 

The stopping time (|25[) with q(y) = \y\ is equivalent to 

N = inf{n > mo : \S n i\ > a} A m, 

where S n i = Y^j=i Xij is the partial sum from the first population. This is the 
truncated probability ratio test depending on three parameters, a > 1, eo and 
e. Simple calculations yield a/N — > p 2 , where p = max{min(eo, \Zl^il)> e l- The 
parameter values are taken as a = 10, e = y/0.1 and eo = \/5.0. So, mo = a/^o = 2 
and to = a/e 2 = 100. Tables 1 and 2 contain results for known o\ and 02, but 
unknown 7, that is, case CI, and for unknown eri, 01 and 7, that is, case C3, 
respectively. 

In Table 1, we report the expected sample size and the lower and upper 0.05, 
0.025 noncoverage probabilities for Zni and Z^K The results show that zff is very 

accurate for all selected parameter values, but Z^\ is negatively skewed. Table 2 

(3) 

compares the coverage probabilities using Zn\ and Z N . The coverage probabilities 
for PTjjfi(\Zpfi\ < z a / 2 ) for a = 0.05 and 0.1 are in the columns with the title 

'^jvi'- The results using (j24f for the pivotal quantity zff are given under the title 
l zff : <at'. As (f2"3")l suggests that zff can be approximated by a t distribution with 



Table 1 

Truncated sequential probability ratio test with known o\ and <T2, but unknown 7; 
replicates = 10,000 (± means 1.96 standard deviations) 



Zni Z 



(01, 02, 7) 


B*,«(JV) 


L0.05 


U0.05 


L0.025 


U0.025 


L0.05 


U0.05 


L0.025 


U0.025 


(0.30, 1.00, 0.40) 


35.42 


0.059 


0.040 


0.031 


0.019 


0.048 


0.049 


0.025 


0.025 


(0.60, 1.00, 0.40) 


17.87 


0.057 


0.039 


0.028 


0.020 


0.048 


0.048 


0.024 


0.025 


(0.80, 1.00, 0.40) 


13.54 


0.056 


0.043 


0.027 


0.022 


0.048 


0.050 


0.024 


0.025 


(0.30, 1.00, 0.80) 


35.20 


0.070 


0.030 


0.036 


0.017 


0.050 


0.050 


0.023 


0.024 


(0.60, 1.00, 0.80) 


17.87 


0.064 


0.036 


0.034 


0.017 


0.049 


0.052 


0.025 


0.026 


(0.80, 1.00, 0.80) 


13.55 


0.058 


0.040 


0.029 


0.019 


0.046 


0.055 


0.023 


0.027 


± 




0.004 


0.004 


0.003 


0.003 


0.004 


0.004 


0.003 


0.003 
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Table 2 

Truncated sequential probability ratio test with unknown o\, 02 and 7; replicates = 10,000 
(± means 1.96 standard deviations) 













: tff 


Z N 


; *a/P 2 


(01,02,7) 




90% 


95% 


90% 


95% 


90% 


95% 


(0.30, 1.00, 0.40) 


35.42 


0.885 


0.934 


0.892 


0.944 


0.896 


0.947 


(0.60, 1.00, 0.40) 


17.87 


0.871 


0.923 


0.884 


0.941 


0.892 


0.947 


(0.80, 1.00, 0.40) 


13.54 


0.863 


0.917 


0.885 


0.936 


0.895 


0.945 


(0.30, 1.00, 0.80) 


35.20 


0.877 


0.929 


0.891 


0.944 


0.896 


0.947 


(0.60, 1.00, 0.80) 


17.87 


0.865 


0.918 


0.879 


0.936 


0.888 


0.942 


(0.80, 1.00, 0.80) 


13.55 


0.859 


0.911 


0.878 


0.935 


0.888 


0.944 


± 




0.006 


0.004 


0.006 


0.004 


0.006 


0.004 



Table 3 

Repeated significance test with known o\ and <T2, but unknown 7; replicates = 10,000 (± means 

1.96 standard deviations) 



Zjvi Z 



(01,02,7) 




L0.05 


U0.05 


L0.025 


U0.025 


L0.05 


U0.05 


L0.025 


U0.025 


(0.30, 1.00, 0.40) 


75.18 


0.064 


0.045 


0.034 


0.023 


0.052 


0.047 


0.026 


0.024 


(0.60, 1.00, 0.40) 


27.53 


0.061 


0.037 


0.031 


0.018 


0.047 


0.045 


0.023 


0.023 


(0.80, 1.00, 0.40) 


16.16 


0.060 


0.041 


0.032 


0.019 


0.050 


0.052 


0.024 


0.025 


(0.30, 1.00, 0.80) 


74.88 


0.093 


0.047 


0.049 


0.024 


0.052 


0.047 


0.025 


0.024 


(0.60, 1.00, 0.80) 


27.26 


0.083 


0.029 


0.041 


0.014 


0.051 


0.044 


0.025 


0.023 


(0.80, 1.00, 0.80) 


16.20 


0.067 


0.030 


0.032 


0.016 


0.049 


0.047 


0.025 


0.024 


± 




0.004 


0.004 


0.003 


0.003 


0.004 


0.004 


0.003 


0.003 



a/p 2 degrees of freedom, we give the results in the last two columns l Z N : t a jpi\ 
Apparently, the coverage probabilities for the naive statistic are all significantly 
less than the nominal values. The results using a/p 2 degrees of freedom are slightly 

(3) 

better than those with N. The distribution of Z N shows no appreciable skewness. 

4-3. Repeated significance test 

The stopping time (|25[) with q(y) = y 2 is equivalent to 

N = mf{n > mo : \S n i\ > \fna] A m. 

This is the repeated significance test depending on three parameters, a > 1, eo and 
e. It is easily seen that a/N — > p 2 , where p = max{min(eo, \6i\), e}- We take a = 10, 
e = VO.l and eo = \/2.0. So, mo = a/e§ = 5 and m = a/e 2 = 100. Tables 3 and 
4 contain results for known a± and 02, but unknown 7, that is, case CI, and for 
unknown <j\, ai and 7, that is, case C3, respectively. 

In Table 3, we see that Z^i is slightly more negatively skewed than in Table 1, 
but is again very accurate for all selected parameter values. The coverage 
probabilities in Table 4 show that the use of Zmi leads to significantly lower coverage 
probabilities than the nominal values, but using zff and a t distribution with a/p 2 
degrees of freedom also works very well for this test. As before, the distribution of 
zff shows no appreciable skewness. 

5. A practical example 

In this section, we illustrate the proposed confidence interval method using the data 
obtained by Bellissant et al. 0. This study was concerned with the treatment of 
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Table 4 

Repeated significance test with unknown a\, 02 and 7; replicates = 10,000 (zfc means 

1.96 standard deviations) 















7 (3 

N 


; *a/P 2 


(01,02,7) 




90% 


95% 


90% 


95% 


90% 


95% 


(0.30, 1.00, 0.40) 


75.18 


0.880 


0.934 


0.897 


0.947 


0.900 


0.948 


(0.60, 1.00, 0.40) 


27.53 


0.872 


0.925 


0.891 


0.939 


0.896 


0.946 


(0.80, 1.00, 0.40) 


16.16 


0.854 


0.907 


0.875 


0.933 


0.886 


0.942 


(0.30, 1.00, 0.80) 


74.88 


0.847 


0.911 


0.891 


0.945 


0.896 


0.948 


(0.60, 1.00, 0.80) 


27.26 


0.850 


0.908 


0.883 


0.938 


0.893 


0.945 


(0.80, 1.00, 0.80) 


16.20 


0.850 


0.904 


0.876 


0.934 


0.887 


0.945 


± 




0.006 


0.004 


0.006 


0.004 


0.006 


0.004 



infants of up to eight years of age suffering from gastroesophageal reflux. The in- 
fants were randomised between metoclopramide and placebo, which they received 
for a two-week period. The pH level in the oesophagus was measured continuously 
using a flexible electrode secured above the lower oesophageal sphincter. The pri- 
mary response variable was the percentage reduction in acidity, measured by the 
proportion of time that pH < 4, over the two weeks of treatment. 

The above variable was taken to be normally distributed and the triangular test 
([20I], Chap. 4) was used to monitor the study Inspections were made after groups 
of about four patients and the trial was stopped after the seventh interim analy- 
sis, with the conclusion that metoclopramide is not an improvement over placebo. 
Although Bellissant et al. Q mention various normally distributed secondary re- 
sponse variables of interest, only standard analyses of them are carried out. For 
example, uncorrected confidence intervals are given for secondary parameters of in- 
terest. Thus, it is interesting to apply the corrected confidence intervals presented 
in Section 3 in this case. 

In order to illustrate the confidence interval method, we assume that there is a 
single secondary response variable, the proportion of time that pH < 4 on day 14, 
and that the patients arrive in pairs, with one patient in each pair being assigned 
to metoclopramide and the other to placebo. The trial data give the estimates 
81 = 0.3, 9 2 = 0.07, <7i = 0.5 and 02 = 0.1. To simulate the trial, we treated these 
values as the true values for the parameters. Further, since the sample covariance 
matrix was not available, we simulated the trial when 7 = 0.4 and 7 = 0.8, as for 
the two sequential tests in Section 4. As in the original trial of Bellissant et al. [2|, 
we use a one-sided triangular test to test Ho : 9\ = against Hi : 0% > and 
choose the design parameters so that it has significance level 5% and 95% power 
for 6>i = 0.5. 

Let m a denote the group size, possibly depending on a > 0. Then the stopping 
time for the above triangular test is essentially of the form 

N = inf{ri > 1 : m a \n, S n i/ai > a + bn — 0.583 or S n i/ai < — a + 3bn + 0.583}, 

where m a \n means that m a divides n and S n \ denotes the sum of the first n 
differences in response between metoclopramide and placebo. Values are chosen 
for the parameters a > and b > in order to satisfy the error probability re- 
quirements, and the number 0.583 is a correction for overshoot of the stopping 
boundaries due to the discreteness of the inspection process ([20j, Chap. 4). Upon 
termination of the test, Hq is rejected if Sni/&i > a + bN — 0.583 and accepted if 
Sjvi/cti < —a + 3WV + 0.583. Now, the above stopping time may be rewritten as 

(26) N — infjrT, > 1 : m a \n and nq(6 nl /ai) > a - 0.583}, 
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Table 5 

Triangular test with unknown o\, 02 and 7; replicates = 10,000 (± means 1.96 standard 

deviations) 











1 




: tjv 




: *a/fl 2 


(01, 02, 7) 


Power 




90% 


95% 


90% 


95% 


90% 


95% 


(0.00, 0.07, 0.40) 


0.021 


7.43 


0.807 


0.864 


0.848 


0.921 


0.892 


0.935 


(0.00, 0.07, 0.80) 


0.021 


7.43 


0.815 


0.867 


0.857 


0.919 


0.896 


0.936 


(0.30, 0.07, 0.40) 


0.574 


10.49 


0.826 


0.885 


0.866 


0.927 


0.894 


0.949 


(0.30, 0.07, 0.80) 


0.574 


10.49 


0.780 


0.849 


0.860 


0.921 


0.892 


0.956 


(0.50, 0.07, 0.40) 


0.956 


8.17 


0.818 


0.877 


0.860 


0.926 


0.893 


0.942 


(0.50, 0.07, 0.80) 


0.956 


8.17 


0.812 


0.867 


0.859 


0.923 


0.896 


0.945 


± 






0.006 


0.004 


0.006 


0.004 


0.006 


0.004 



where q(y) = max(y — b, 36 — y). Note that (|26f is a special case of more general 
stopping times studied by, for example, Morgan [l3j |. So we have a/N — > p 2 , where 
p = max(y / 0i/(7i — b, ^/36 — 6i/<ii), provided that m a — o(a). As in Bellissant et 
al. Q, we take a = 5.495 and b — 0.2726. These values may be obtained using 
PEST 4 (BJ. Since the data are being monitored after groups of four patients, we 
have m a = 2. 

In Table 5, we report the probabilities of rejecting Hq, that is, the power, the 
expected numbers of pairs of patients, and the coverage probabilities using Zmi and 
Z N , all of the results being based on 10,000 replications. Although the simulated 
sequential test satisfies the power requirement for 0\ — 0.5, it is a little conservative. 
This is because the above stopping time is not exactly the same as the original. 
Now, we know from Section 4 that the confidence intervals based on Zni have 

(3) 

coverage probabilities below the nominal values and that those based on Z N have 
roughly the correct coverage probabilities. The results in Table 5 show that the use 
of Z y N ' leads to coverage probabilities which are usually quite close to the nominal 
values, especially given the small sample sizes. Note that, since our theory has been 
developed for the case where p = p(0i), when calculating the correction terms, o\ 
has been replaced with its estimate except in terms involving p, when its true value 
is used. We return to this point in Section 6. 

Returning to the actual trial, a standard analysis gives an uncorrected confidence 
interval for 62 of (0.018, 0.122), whereas the corrected confidence interval is (0.008, 
0.124) when 7 = 0.4 and (0.002, 0.122) when 7 = 0.8. So the approach is useful in 
practice, especially if the correlation coefficient is large. 

6. Discussion 

In this paper, we have shown how corrected confidence intervals for secondary 
parameters may be constructed following a sequential test in which one component 
of a bivariate normal process is being monitored. The intervals have a simple form 
and very weak expansions are used to justify them. Simulation of two well-known 
sequential tests show that the approximations are very accurate. We have also 
illustrated the approach using a real-life example. 

We have only considered sequential tests based on the mean of the first compo- 
nent of a bivariate normal process. As we have seen in Section 5, a sequential test 
may also depend on the variance of the first component, so that p — p{9%, <J\). The 
derivation of the variance correction term is more complicated in this case, since 
the sampling variation in a\ needs to be allowed for. For some related work in this 
direction, see [26j . 
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There may be several primary response variables in practice. So a natural ex- 
tension would be to generalise the ideas in Sections 2 and 3 to a p-variate normal 
process, where p > 2. Such a development would require consideration of the mul- 
tivariate version of the signed root transformation and an application of the results 
of Weng and Woodroofe (l7j for the p-parameter exponential family in order to 
determine the analogues of the mean and variance correction terms in (17) and 
(18). 

Although we have considered both the known and unknown covariance matrix 
cases in this paper, one assumption that we have made is that the correlation coeffi- 
cient between the two components of the response vector is constant over time. This 
is called the proportionality case by Hall and Yakir 9] . Another natural extension 
would be to generalise the ideas in Sections 2 and 3 to the non-proportional case 
where the correlation coefficient is a function of time. 

A further possible extension is to consider two binary streams of data, where 
the primary parameter is the log odds ratio and the secondary parameters are the 
individual success probabilities. Although approximations may be obtained using 
the results of Weng and Woodroofe [17|, they do not lead to simple formulae. 
However, it would be interesting to compare this approach with that of Todd and 
Whitehead [ltjj . and also to consider unequal sample sizes. 



Appendix A 

A. 1. W aid-type equations for bivariate normal models 



In this subsection, we provide some results on randomly stopped sums for the 
bivariate normal models. Recall the definitions of 7 and of, i = 1,2, in Section 3. 
Now define of = (N - l)6f /N, i = 1,2. So, 



( 27 J a i - a i - °i - \Pi - Vi) j 



and 7 defined in Section 3 can be rewritten as 

, os , . Ef=i(^-^i)(^-^ 2 ) 

(28) 7 = — '— 



N&X&2 

Let Lin denote the likelihood function based on the first population and let L' 1N 
denote the partial derivative of Luv with respect to a\, so that 



N N 
L 1N = exp { -— log(2vr) - — log(af ) - 



-i)-iE(^-^) 2 l 

1 i=i J 



and 



d r 1 A, „ N 



(29) L> N = ^L 1N = |^E(^ - 0,? 



N- 



Then we also have 

A(2afL' 1Ar ) 1 f N 
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Let p be as in (fT4|). Some of the derivations in Lemma[7] below rely on the identity 
(e.g. 0, Chap.l) 



(81 ) -^E^. e (M N ) = I M N f-^ logp ) dP^g 



for i = 1,2, where 



d , N 



JV 1 U,"r^ 



2 JV 

2 



9a2 2o 2 2o 2 o 2 (l- 7 2 ) 



or — 

1 3=1 



N 

2 



and dlogp/da^ has a similar form. 

Lemma 7. Suppose that (eH, Mjv = Mjy(Xn, . . . , Xljv, X 2 i, . . . , X2jv), 0) 
is twice differentiable and V 2 6 is bounded. Then the following hold: 

(i) Et{b(i), 9)M N } = £^{6(^ 0)}£ £ -(Mtf), wftere f - $6/^(6) G S; 



(ii) ^{Ef=i(^ ~ *<) W = V * = i. 2; 



1,2: 



(iii) ^[{E;=i(^ ~ Oi?Y/N] = ^(2o? + ofiV) /or z : 

(iv) £f{£jLi(*« ~ ^i) 2 EjLiPGy - 2 )W - ^(27 2 ^ 2 o 2 2 + ofofiV). 

Proof. The proof of (i) is straightforward and hence omitted. Consider (ii) . Taking 
Mat = 1/N in IJ5TJ) for i = 1, 2 leads to 

7-^,0 S — ^ f = -o-i o-2(l - 7 ) 

o 2 / Ef = i(^u- i) 2 l 

+ oT^' e j N / 

~2o3a 2 (l- 7 2 )^2^,e(^) 

and a similar equation with oi and 02 switched, where dE,p j g(l/N)/daf — 0, since 
N defined in (|25|) does not depend on Ui\ and therefore, 

(32) E,, = 1 ^ fk^_J . 



By (22J, 



Ej=i(^ij - 0i) 2 1 „ 4 L'uf . 2 



. ^-x, -„ ~_ , 2o?^^ + o 

I iV I H 'A^iw 
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where 
E ( (2*1 



Tl 

4 



NL 



IN 



4of£(of , Or) + 2a 2 -^(a 2 A) \ E au9l (-^d^dfli 



1 



■N' 



4a 2 1 ^a 2 1 ,6 1 )E (TlM (^)dald9 1 

where the third line follows by an integration by parts and the fact that £ is defined 
on a compact set and vanishes on the boundaries, and the last line is zero since 



E auei (l/N) does not depend on a x . So, ^{E^iC^i? - Oif/N} = E^af), and 



then by p2j) , we have £ ? {EliP^ - <9 2 ) 2 /^} = E $ (a. 



Now consider (hi) and (iv). First, by (j3"0)) . 



N 



J IN 



N 



NL 1N 



2a i l +2alY J (Xi J -e 1 ) 2 -Nat 
3=1 



where i?^{(8CT^i' 1JV + 4afL'^ N )/(NLusr)} = can be proved in the same way as 



in the preceding paragraph, and Ej=i(^ij — #i) 2 } = E^afN) by Wald's 

equation. Hence, 

(33) 



E f 



Next, taking M N = Efei (^lj ~ 0i) 2 /N in (EH) for i = 1, 2 leads to 



Sf =1 (^--gi) 2 
JV 



a^ 2 



(34) 



1 \ N 2I 



1 



2of(l -7 2 ) 
7 



{EL(^y-W 



2\2 



iV 



2<T?(7 2 (l-7 2 ) 



E?=i(*u - 0i) 2 E?=i{(*u - - #2)} 



-,2 V^W 



2V 



and a similar equation. Taking = E*Li(^2j — &2) 2 /N in (f3Tjl for i = 1, 2 leads 
to two further equations. By (ii), (f33l) . (f34l) and the other three equations, we obtain 
(iii) and (iv). □ 
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A. 2. Biases and variances of estimators 

We now give some properties of the estimators for the bivariate normal models. 
Similar calculations are carried out by Coad and Woodroofe Q for an adaptive 
normal linear model. 

Lemma 8. Suppose that (eS and that <j6j) holds with q = 1. Then the following 
hold: 

(i) E € 0i - 6i) = o(l/Vo) for i = 1, 2; 

(ii) £ € (£f-af)= (l/a)/ H = l,2. 

Proof. Consider (i). First, by (|6|), — 6*i)l{a/Ar a >i/r)}] = o(l/a). Then, observe 

that 

V^i " 6i)ha/N a <l/ V } = \[^ Nl/2 ^ - ^{a/N^l/,} => N(0, ^p 2 ). 

Since a/N a < 1/n and N x / 2 {9i — 6*^) is uniformly integrable by Lemma[3J the left 
side converges in the mean. Hence (i) follows. 

Next, consider (ii). From (J2TJ) and LemmaEf ii) , aE^af-a}) = -aE^{(§i-0i} 2 }. 
Then, since of — of = of — erf + of /(AT — 1), we have 

aEttf - o 2 ) = -al%{$ - ^) 2 } + S £ (^5f ) + o(l), 

where a(9i — 9i) 2 =*> p 2 <rf Xi an d is uniformly integrable, by a similar argument to 
the preceding paragraph, and hence converges in the mean, and the second term is 
E^(p 2 af) + o(l). So the result follows. □ 

A simple consequence of Lemma HJii) is 

(35) E s {uj n - 1) = o(l/a). 

The derivation of Lemma Elm) (iv) below relies on the identity 

(36) ^E^.g{M N ) = J M N (J^logp) dP^.g, 



where p is as in (|14| and 

9, tv 7 (i + 7 2 ) Ef=i(^y - Oi)(x 2j - e 2 ) 

— l0gp= 2 + 2^2^ 



7 



S-Li(^ij-gi) 2 , E- v = i(^-e 2 ) 2 



(1 - 7 2 ) 2 \ o\ o~2 j 

N 7 (1 + 7 2 ) |Wj , A^i - 0i)(0 2 - 2 ) 



1 — 7 2 (1 — 7 2 ) 2 1 O1O2 O1O2 j 

7 J iNffi+iVCei-fli) 2 A^of + JV(0 2 - e 2 f 



(l-7 2 ) 2 [ <*? 
Lemma 9. Suppose that £ € S. T/ien t/ie following hold. 
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(i) E^Niaf - of) 2 } = 2E^af) + o(l) for i = 1, 2; 

(ii) Et{N(*f - a 2 )(a 2 - of)} = 2^( 7 2 o 2 o 2 2 ) + o(l); 

(iii) E^ - 7) = -E f { 7 (1 - 7 2 )/(2iV)} + o(l/o); 

(iv) ^{( 7 -7) 2 }=o(l/^)- 

Proof. For (i), first from Wald's equation, 



JV 



(37) = E^ e { YX x *i - ~ N(J i 

Next, by (f27|) we can write 
Ntf - o~ 2 ) 2 



E^ fi {N{d\ - of) + N(0i - 9i) 2 }. 



TV 



2iV(of - of ){of + (0* - 9 t ) 2 } - iV{of + & - 9if} 2 



where £ £ {iV(o 2 - of)(0j - tf^ 2 } and E 6 {N(9i - 9 t ) 4 } are both o(l). Then, together 
with Lemma [7J iii) and ([57]) . we obtain the desired result. 
For (ii), by (HZ]), 

JV N N 

iV(o 2 - o-\)(*l a\) = 1 ^(X, - ^) 2 " ~ ^E<^ " ^ 



N 



3 = 1 



3=1 



N 



-a 2 2 J2( X i3 - Oi? - 0i 9 1 ) 2 £(X 2j - ^ 2 ) 2 



j'=i 



JV 



-(4 - 2 ) 2 £(*« - °if + N (di - °i) 2 (°2 - e 2 f 

3 = 1 

+N<7 2 {§ 2 - 6 2 ) 2 + N<40i - #i) 2 + Na\a\, 



where we have £ C {£f =1 (Jfy - 0i) 2 Ef=i(^ - # 2 ) 2 /iV} = £ ? (2 7 2 o 2 o 2 + o 2 o 2 7V) 



by Lemma^iv), E^a 2 Y^ =l {X 2j - 9 2 ) 2 + a 2 E- =1 (*U ■- #i) 2 } = 2£ f (o 2 o 2 iV) by 
Wald's equation, E^{N(9i — 9i) 2 (9 2 — 9 2 ) 2 } — o(l) because the integrand approaches 
zero and is uniformly integrable, and 



E f 



N 



E, 



-9 2 ) 2 ^Na 2 1 -J2(X lj -9 1 ) 

N 

N(9 2 - 9 2 ) 2 J2{(Xi ] -9 1 ) 2 -a 2 }/N 



3 = 1 



0(1) 



because N(9 2 - 9 2 ) 2 = O p {\) and Y£=i{{ x ij - 6 i) 2 - ^i}/ N = o p (l) are both 
uniformly square integrable. Similarly, I%[(#i — 6*i) 2 {7Vcr 2 — X)i=i(^2j — 9 2 ) 2 }] = 
o(l). Hence (ii) follows. 

Consider (iii). Taking Mjv = l/iNaxai) m (|3^|) . and then multiplying both sides 
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by (1 — 7 2 ) 2 leads to 



(38) 



1 



N&X&2 



E 



7(1-7') 



2 J 1 



(1 + 7 2 ) 



1 + 



/ \-o 1 )(6 2 -e 2 )' 



&1&2 I &1&2 

7 fa 2 + (0 1 -0 1 ) 2 l 



a 2 v 



(T1CT2 



From (|28p. the distribution of 7 does not depend on the values of u\ and o 2 - So, 
without loss of generality, we take a\ = a 2 = 1 in the evaluation of £^0(7). Letting 
V>o = (1) 1,7)', then together with (|3"5j) we have 



_ (1-7 2 ) 2 a 



I+7 2 d^ ' 6 1 + 7 



/ 1 



7(1 -7 2 ) 



01(72 



(§!- 6^(62-62) 



C1C2 



1 + 7 2 



Now we claim that 
(39) 



9 



0102 



1 



(9 7 V N<J\CT2 



1 + 7 

h - e 2 f 

d\&2 



a 



2 i'o,- 



a 2 1 



(40) 



E f 



1 



0-10- 2 



1 + \{\EdP 2 ) + \E t {p 2 l 2 °l°t)} + o(h 



i 41 . ^ ( Jl + j£ ) = 2 + i{^(p 2 ) - W 7 V 2 a 2 )} + o(i), 

o"2 01 / a ^ a 



(42) 
and 
(43) 



? 2 - e 2 f I 2 



010 2 



0102 



(e 1 -e 1 )(e2-e 2 ) 



0102 



: ^e(7P 2 



Since the verifications of (|39|) (|43|) are similar, here we only sketch the proof for 

T 2 - 2 



40]) . A Taylor series expansion about the point 2 o = a 20 = 1 gives 



^--l-J(- 2 -l)-J(^ 2 2 -l) + |(- 2 -l) 2 

0102 ^ 
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Then, by (i) and (ii), we obtain (|40|) . 

For (iv), we take M N = l/{Nd l d 2 ) 2 in ([36]), multiply both sides by (1 - 7 2 ) 2 , 
and then take the derivative of both sides with respect to 7. We obtain 




L)(02 



JV^) 2 



O-f 



where E^(I a ) = 0(l/a) and the last equality follows from tedious calculations, 
which are omitted here. Since 



£ 'I" 



(l-7 2 ) 2 ^, 
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(JVcti*^ 2 



the required result follows. 



□ 



Note that, in the absence of a stopping time, Lemma Etui) reduces to the usual 
bias formula for the sample correlation coefficient in the fixed-sample case {e.g. 14j . 
Chap.5). 



Lemma 10. Suppose that £ e S. TTien VJV(wjv — 1) => JV(0,2). Moreover, N(ujn- 
l) 2 is uniformly integrable with respect to P^. 

Proof. The first statement follows since 



iV, 



Vn(lj n - 1) = —3- (of - 0-3) 



2.2 

2 



(JV — 1)ct| 



JV 



JV(0 2 



3) 2 



JV — 1 



where the first term on the right-hand side converges in distribution to JV(0, 2) by 
Anscombe's theorem and the last two terms are o p (l). 

For the second statement, it suffices to show that N(ujn — l) 2 converges to 2% 2 
in the mean. From Lemma [3Ji) and the relationship between &2 and 172 , we have 
E^{N(u N - l) 2 } -> 2. So the result follows. □ 

Two additional results are needed for Lemma fTTI By ([6]) with q > 1/2 and 
Lemma [9ji), we have 



(44) E^aiaf - aiYl {Na>na} ] 



JV 



N(ai - af) 2 l {Na>na} =0(1) 
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for i = 1,2, and, by ([37]) , we have 

(45) Ez[a\\6-6\\\ Na>ria} }=Es[^N\\§-6\\\ Na>va} ] =0(1). 

Lemma 11. Let g{tj), 9) be twice continuously differentiable on a compact set K C 
(0, oo) 2 x (—1. 1) x CI. Suppose that £ € 5 and (0) holds with q> 1/2. Then 

Ee[{g(1>,0) - 9(i>,6)}l {Na >na}] = o(l/y/a). 
Proof. By compactness and continuity, there exists C > such that 

\g{^,6) - g$,§) -{$-$,9- §)Vg(i>,6)\ < C(|$ - V>|| 2 + \\6 - 9\\ 2 ). 

Now, since E e [{\\^ - V>|| 2 + \\0 - 6>| | 2 )l {A r a>l;a} ] = o(l/y/a) by Lemma EJiv) , flU 

and gSJ), and ||^(0-0)|| + ||^('0-t/')|| = o(l/y/a) by Lemma Efi)(ii) and Lemma 
(9](iii), the statement follows by using the arguments in Proposition 6.13 of Weng 
and Woodroofe 0. □ 

Note that, if K — ^ 9 i=1 Ki, Ki are compact sets, K° n K° = for i ^ j, where 
K° denotes the interior of Ki, and g is twice piecewise continuously differentiable 
on Ki, then we can write 

9 r 

EdM, 0) - 9$, 0)} = E / M> 0)E*Mil>, 0) - 9(iP, &)}dOdiP 
i=i Jk * 

9 i r 

= i / - E M^,e)-g(i>,e)}, 



2 — 1 



where Ci are normalising constants and £j = c^l^. Thus, Lemma [TT] holds for such 
g. In particular, it applies to k and yields E^[{k^' — K }l{Af a >ija}] = f° r 
fc = 0,1,2,3. 



A. 3. Proo/ of Theorem^ 

Three lemmas are required for the proof. 
Lemma 12. Let h be a bounded function, and let 

H (a,n) = / h{^—^)^{z)dz 
Jst a 

and 

Hi(a, /it) = / z/i( )<p(z)dz 

J3i °~ 

for a > anc? — oo < // < oo. TTien -ffo an d H\ have continuous derivatives of 
all orders. Further, at fi = and a — 1, we have Hq — <& l h, dHo/d^j, — — $ 1 L r /i, 
dHo/do- = -2^Vh, d 2 H Q /d[i 2 = 2$> x Vh, B x = ^Uh, dHi/dp = -2&Vh, 
dE x jda = and d 2 H 1 /dfi 2 = 0. 
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This lemma is a simple extension of Lemma 13 of Weng and Woodroofe and 
can be proved analogously. Note that, if h is symmetric, then ^Uh = 0, and hence 
dHo/dfj, = Hi = 0, and Lemma fT2l reduces to their Lemma 13. 

Now define 



R2, a (h) = a E?{h(Z N )} - $ 2 ft - (^Uh)'E[ {rf (0, 0)} 
\ v jv 



-tr^w^p^r^M)}; 



Then, by Proposition [2j we have 



R2,a( h ) = 



rf(M)} 



-tr^W^^^)}] - tr[&VhE? {ft6)T%(e,B)}] 



Lemma [T31 below is similar to Theorem 7 of Weng and Woodroofe 17J, but here we 
consider i?2,a(^-) for all bounded h, not necessarily symmetric. 

Lemma 13. If (OJ) holds with q = 1 and £ € So, i/ien i«7Ti a _ >00 |^[i?2,a(^)l{jv a > 7 j a }]| 
= /or aZ/ bounded h. 

(2) 

Proof. First, lima^ool^j^.aC 1 )}! = follows by the same argument used to prove 
Theorem 7 of Weng and Woodroofe (l7| . Next, since 



Et:{{<S> 2 Uh)'E? 



-{T\{e,e)-T\{6,e)} 



l{AT a >r )Q }) 



<CiV£EHI|r?(M)-rf(M)l|} 

for some constant Ci, the right-hand side is o(l) by Lemma [TT1 □ 

Proof of Theorem [4j Since h is bounded and both P^(N a < rja) and P^(Bj^) 
are o(l/a), it suffices to show that E^[h(Z ^)^{N a >na}nB N 

} = $ 1 /i + o(l/a). Write 

h(Zx) = h a {Zfi\). Then, by the definition of i?2,o, 
A0)\ 



(46) 



E£{h{Z$>)}=E£{h a (Z N1 )} 

1 



1 



($ 1 7A a )^{p 2 (9)4 11 (9,8)} + -R2.a(ha), 



where £ , ^[i?2,a(^a)l{Ar a >j 7 a}] — > as a — > oo by Lemma [T51 Since h here may not be 
symmetric, by Lemma [T^l two additional terms arise in the analysis of (I46[) . namely, 



A(h) 



V a v •* 

To show that the effect of non-symmetry of h vanishes, observe that 

E ( [A(h)i {Na>na} ] = i^tttj^Kp^rf^M) - ^}i{iv a >w] 

v a 

= i=(*lt7/l)S € [{K(e) - K^}l {JVa >,„}], 
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where the last line is o(l/a) by Lemma [TT1 So, the theorem follows. 
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A. 4- Proof of Theorem{5\ 

Given a measurable function h, s > 0, c > and v e 5R, let 
(47) h*{z) = h{s-ic- 1 (z-u)}, 

V (h; v, s) = -{^Uh)v + {^Vhy 2 - 2($ x Wi)(e - 1) 

-(^y/i)^ - 1) - (* 3 h)v{a - 1) + ^($ 4 ft)(* ~ !) 2 



and 



where 



and 



i/, a) = -2($V/i> + ($ 3 /i)(s - 1), 



$4/l = X { I (z2_1)2 ~5 }/l(2)$1{&} - 
Lemma 14. There is a constant C for which 

\<& l h* - ^h - * (h; u,s)\< C{>| 3 + \s- 1| 3 + |c - 1| 3 / 2 }, 
I^CZ/i* - $ 1 CTh - i/, s)| < C{>| 2 + |s - 1| 2 + |c- 1|} 

and 

I^Wi* - $ x y/i| < C{\v\ + \s -l\ + \c- 1|}, 

for all \v\ < 1, \s- 1| < 1/2, |c- 1| < 1/2 and bounded h. 

We omit the proof of this lemma since it can be derived in a similar manner to 
Lemma 1 of Woodroofe and Coad [25| . 

Proof of Theorem [5j We shall only consider Z N , as the same argument applies 
to From (f2"2"|) , we can write h{zff} — /i*(Zjyi), where Zjvi is defined in (fTHJ) 
and h*(z) is as in ((47]) . As in the proof of Theorem [4] we only need to consider the 
set {N a > na} n Bat. If ij; = (ct 2 ,ct|,7)' is known, the bivariate normal model is 
a two-parameter exponential family. Let -E^ denote the conditional expectation 
given ip and the data by time N. So, by Proposition^ 

Eg'+{h*(Z m )} - &h = Eg+{h*(Z N1 )} - $ l h* + ^h* - &h 
= ±(^Uh*)E^>{p(6)T{ 1 (9,8)} 

1 &Vh*)E^{p 2 (e)rl n (9,9)} 



(48) 



a 

-R2.a{h*{Z N )} + ^h* - &h, 

a 
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where E^ 2 [R2 t a(h*)^{N a >na}] -» as a -> oo by Lemma 1131 Then, by Lemma 1141 
we can write the last two lines of (l48l) as 



■^{^Uh - 2(4> 1 W l )^ /2 A^ ) + (®sh)(u N - l)}E^{p(9)r{ 1 (9, 9)} 

+ - a {^Vh)Eff{p\6)Yi n {B,e)} - (^Uh)^' 2 ^ + (^VhHco'f^r 
- 2(<P 1 Vh){f {2) - 1} - (^Vh)(w N - 1) - {^h)Q:]i 2 ^\u N - 1) 

+ h$ i h){u JN - if + o{-) 

I a 
= -^=(^Uh)I N + -{^VKjllN + -={®zh)III N + (<S> 4 h)IV N + o(-), 



where 



I N = E^{p(9)Tl 1 (6,e)}-u J li 2 k (2 \ 

n N = E^{pi(e)rl n (e,e)} 2^ 2 k^E^{ P (9)r{ 1 (9,9)} + K 2 ^} 2 

-2a{f {2) -l}-a(u N -l), 



and 



m N = Eff[{u> N - i){p{e)v\ tl {e,6) -co'J 2 ^}} 

IV N = ~{u> N - l) 2 . 



To prove (|23| . it suffices to show that E^ [lNl{N a >r)a}] = o(l/\/ffl), that 
£; € [J7jvl{Ar >^ a }] = o(l), that E!=lIII N l {Na>ria} } = o(l/y/a) and that a^[7Viv x 
l{Ar a >r;a}] = E$ {p 2 (9) } + o( 1) . For I N , recall from (|Tj) that we may write E^ 2 {p{9) x 
(#,#)} = E^ 2 {n(ai, 7, pio)}, which together with Lemma fTTI yields 

V / a-Bf[iivl{iV tl >^o}] = ^^[{'t ~ ^7V 2 '«7V ) } 1 {A r a>»)a}] = C^ 1 )' 

Next, consider Jijv- By consistency of &2 and k^\ and (JTSJ), 

£ e [/> 2 (0)rI,n(M) - 2p(9)4 ) u>]l 2 rl 1 (e,d) + {k^ 2 } 2 ] 

= E 6 {p 2 (9)Ti Al (d,6) - 2p(0)«Tf il (M) + + o(l) 
= ^( K 2 ) + o(l). 

So, by definition of f^ 2 and (|35p . 

£ e (/7jv) - E f [ K 2 - 2a{4 2) - 1}] - aE e {u N - 1) + o(l) 
= -aE 6 (u N - 1) + o(l) 
= o(l). 

For II In, write 

V5(j/Jj0 = - i){p(0)rf tl (0, e) - wj 2 k (2) }, 

where |p(0)r^ 1 — ui^k^l is bounded, and y/a(u)N — 1) converges to a limit with 
mean zero and is uniformly integrable by Lemma ITD1 So E^(IIIn) = o(l/y / a). 

For IVn, first observe by © that E^[IVNl{N a <ria}] — o(l/a). Then, note that 
we have a/Vjvl/jv a »7o} = a(^Ar — l) 2 l{iv a >)ja}/2, which is uniformly integrable and 
approaches p 2 xl by Lemma ITOl So, the desired result follows. □ 
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